NLP Driven Models for Automatically Generating Survey Articles for Scientific Topics
نویسنده
چکیده
This thesis presents new methods that use natural language processing (NLP) driven models for summarizing research in scientific fields. Given a topic query in the form of a text string, we present methods for finding research articles relevant to the topic as well as summarization algorithms that use lexical and discourse information present in the text of these articles to generate coherent and readable extractive summaries of past research on the topic. In addition to summarizing prior research, good survey articles should also forecast future trends. With this motivation, we present work on forecasting future impact of scientific publications using NLP driven features.
منابع مشابه
Surveyor: A System for Generating Coherent Survey Articles for Scientific Topics
We investigate the task of generating coherent survey articles for scientific topics. We introduce an extractive summarization algorithm that combines a content model with a discourse model to generate coherent and readable summaries of scientific topics using text from scientific articles relevant to the topic. Human evaluation on 15 topics in computational linguistics shows that our system pr...
متن کاملGenerating Phrasal and Sentential Paraphrases: A Survey of Data-Driven Methods
The task of paraphrasing is inherently familiar to speakers of all languages. Moreover, the task of automatically generating or extracting semantic equivalences for the various units of language— words, phrases, and sentences—is an important part of natural language processing (NLP) and is being increasingly employed to improve the performance of several NLP applications. In this article, we at...
متن کاملTopics of Disasters in Scientific Outputs of Medical Sciences: A Cross-Sectional Study
Background: Accurate and timely information plays an important role in disaster preparedness and this information is partly obtained through research and scientific articles. This study aimed to evaluate the publication status of scientific articles about disasters and accidents in Iranian Medical Journals from 2010 to 2015. Materials and Methods: All Persian articles on the subject of natural...
متن کاملBringing Structure to Text: Mining Phrases, Entity Concepts, Topics, and Hierarchies
Mining phrases, entity concepts, topics, and hierarchies from massive text corpus is an essential problem in the age of big data. Text data in electronic forms are ubiquitous, ranging from scientific articles to social networks, enterprise logs, news articles, social media and general web pages. It is highly desirable but challenging to bring structure to unstructured text data, uncover underly...
متن کاملExplaining Similarity of Terms
Computing the similarity between entities is a core component of many NLP tasks such as measuring the semantic similarity of terms for generating a distributional thesaurus. In this paper, we study the problem of explaining post-hoc why a set of terms are similar. Given a set of terms, our task is to generate a small set of explanations that best characterizes the similarity of those terms. Our...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015